Search CORE

40 research outputs found

A semiparametric approach for a multivariate sample selection model

Author: Chavent Marie
Liquet Benoit
Saracco Jerome
Publication venue: Academia Sinica, Institute of Statistical Science
Publication date: 01/01/2010
Field of study

International audienceMost of the common estimation methods for sample selection models rely heavily on parametric and normality assumptions. We consider in this paper a multivariate semiparametric sample selection model and develop a geometric approach to the estimation of the slope vectors in the outcome equation and in the selection equation. Contrary to most existing methods, we deal symmetrically with both slope vectors. Moreover, the estimation method is link-free and distributionfree. It works in two main steps: a multivariate sliced inverse regression step, and a canonical analysis step. We establish pn-consistency and asymptotic normality of the estimates. We describe how to estimate the observation and selection link functions. The theory is illustrated with a simulation study

HAL-Inserm

University of Queensland eSpace

A new sliced inverse regression method for multivariate response

Author: Coudret Raphaël
Girard Stéphane
Saracco Jerome
Publication venue: 'Elsevier BV'
Publication date: 01/01/2013
Field of study

International audienceA semiparametric regression model of a q-dimensional multivariate response y on a p-dimensional covariate x is considered. A new approach is proposed based on sliced inverse regression (SIR) for estimating the effective dimension reduction (EDR) space without requiring a prespecified parametric model. The convergence at rate square root of n of the estimated EDR space is shown. The choice of the dimension of the EDR space is discussed. Moreover, a way to cluster components of y related to the same EDR space is provided. Thus, the proposed multivariate SIR method can be used properly on each cluster instead of blindly applying it on all components of y. The numerical performances of multivariate SIR are illustrated on a simulation study. Applications to a remote sensing dataset and to the Minneapolis elementary schools data are also provided. Although the proposed methodology relies on SIR, it opens the door for new regression approaches with a multivariate response

CiteSeerX

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

HAL-INSU

HAL Descartes

Oskar Bordeaux

Rotation orthogonale en ACP de données mixtes. Le package PCAmixdata et une application en sociologie culturelle.

Author: Chavent Marie
Kuentz-Simonet Vanessa
Lakatos Zoltan
Saracco Jerome
Publication venue: HAL CCSD
Publication date: 02/07/2012
Field of study

Rotation orthogonale en ACP de données mixtes. Le package PCAmixdata et une application en sociologie culturelle

INRIA a CCSD electronic archive server

Oskar Bordeaux

A sliced inverse regression approach for block-wise evolving data streams

Author: Saracco Jerome
Publication venue: HAL CCSD
Publication date: 06/04/2012
Field of study

International audienc

INRIA a CCSD electronic archive server

A sliced inverse regression approach for block-wise evolving data streams

Author: SARACCO Jerome
Publication venue
Publication date: 06/04/2012
Field of study

International audienc

INRIA a CCSD electronic archive server

Oskar Bordeaux

Clustering of Variables for Mixed Data

Author: CHAVENT Marie
SARACCO Jerome
Publication venue: 'EDP Sciences'
Publication date: 01/01/2016
Field of study

This chapter presents clustering of variables which aim is to lump together strongly related variables. The proposed approach works on a mixed data set, i.e. on a data set which contains numerical variables and categorical variables. Two algorithms of clustering of variables are described: a hierarchical clustering and a k-means type clustering. A brief description of PCAmix method (that is a principal component analysis for mixed data) is provided, since the calculus of the synthetic variables summarizing the obtained clusters of variables is based on this multivariate method. Finally, the R packages {\bf ClustOfVar} and {\bf PCAmixdata} are illustrated on real mixed data. The PCAmix (resp. ClustOfVar) approach is first used for dimension reduction (step1) before standard clustering of the individuals (step 2)

EDP Sciences OAI-PMH repository (1.2.0)

INRIA a CCSD electronic archive server

Oskar Bordeaux

BIG-SIR: a Sliced Inverse Regression approach for massive data

Author: LIQUET Benoit
SARACCO Jerome
Publication venue: 'International Press of Boston'
Publication date: 01/01/2015
Field of study

International audienc

INRIA a CCSD electronic archive server

Queensland University of Technology ePrints Archive

Oskar Bordeaux

Variable importance assessment in sliced inverse regression for variable selection

Author: JLASSI Ines
SARACCO Jerome
Publication venue
Publication date: 01/01/2016
Field of study

We are interested in treating the relationship between a dependentvariable

y

and a multivariate covariate

x \in {\R}^p

in asemiparametric regression model. Since the purpose of most social,biological or environmental science research is the explanation, the determination of theimportance of the variables is a major concern. It is a way todetermine which variables are the most important when predicting

y

. Sliced inverse regression methods allows to reduce the space of thecovariate

x

by estimating the directions

\beta

that form aneffective dimension reduction (EDR) space. The aim of this paper isto propose a computational method based on importance variable measure (only relying on the EDR space) in order to select the most useful variables. The numerical behavior of this new method, implemented in R, is studied on a simulation study. An illustration on a real data is also provided

INRIA a CCSD electronic archive server

Oskar Bordeaux